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The evaluation mechanism of pattern matching with dynamic patterns is modelled in the Pure Pattern 
Calculus by one single meta-rule. This contribution presents a refinement which narrows the gap 
between the abstract calculus and its implementation. A calculus is designed to allow reasoning on 
matching algorithms. The new calculus is proved to be confluent, and to simulate the original Pure 
Pattern Calculus. A family of new, matching-driven, reduction strategies is proposed. 



Introduction: Dynamic Patterns 

Pattern matching is a basic mechanism used to deal with algebraic data structures in functional pro- 
gramming languages. It allows to define a function by reasoning on the shape of the arguments. For 
instance, define a binary tree to be either a single data or a node with two subtrees (code on the left, in 
ML-like syntax). Then a function on binary trees may be defined by reasoning on the shapes generated 
by these two possibilities (code on the right). 

let f t = match t with 
type 'a tree = . _ 

Jr . Data d -> <codel> 

Data a 

' , ^ , , I Node (Data d) r -> <code2> 

Node of 'a tree * 'a tree 

I Node 1 r -> <code3> 

An argument given to the function f is first compared to (or matched against) the shape Data d (called 
a pattern). In case of success, the occurrences of d in <codel> are replaced by the corresponding part 
of the argument, and <codel> is executed. In case of failure of this first matching (the argument is not a 
data) the argument is matched against the second pattern, and so on until a matching succeeds or there is 
no pattern left. 

One limit of this approach is that patterns are fixed expressions mentioning explicitly the constructors 
to which they can apply, which restricts polymorphism and reusability of the code. This can be improved 
by allowing patterns to be parametrised: one single function can be specialised in various ways by in- 
stantiating the parameters of its patterns by different constructors or even by functions building patterns. 
For instance in the following code, the function f would take an additional parameter p which would 
then be used to define the first two patterns. In this case, instantiating p with the constructor Data would 
yield the same function as before, but any other function building a pattern can be used for p! 

let f p t = match t with 

I p d -> <codel> 

I Node ( p d) r -> <code2> 

I Node 1 r -> <code3> 

However, introducing parameters and functions inside patterns deeply modifies their nature: they 
become dynamic objects that have to be evaluated. This disrupts the matching algorithms and intro- 
duces new evaluation behaviours. This paper intends to give tools to study these extended evaluation 
possibilities. 
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The Pure Pattern Calculus (PPC) of B. Jay and D. Kesner HJK09I |Jay09[ models the behaviour of 
dynamic patterns by using a meta-level notion of pattern matching. The present contribution analyses the 
content of the meta pattern matching of PPC (reviewed in Section [TJ, and proposes an explicit pattern 
matching calculus (Section [2]) which is confluent, which simulates PPC, and which allows the descrip- 
tion of new reduction strategies (Section 3.1 1. An extension of the explicit calculus is then discussed 
(Section [3T2]> before a conclusion is drawn. 



1 The Pure Pattern Calculus 

This section only reviews some key aspects of PPC. Please refer to [JK09 ] for a complete story with 
more examples. The syntax of PPC is close to the one of A -calculus. The main difference is the re- 
placement of the abstraction over a variable Xx.b by an abstraction over a pattern (with a list of matching 
variables) written [6]p -+ b. There is also a new distinction between variable occurrences x and match- 
able occurrences x of a name x. Variable occurrences are usual variables which may be substituted while 
matchable occurrences are immutable and used as matching variables or constructors. 

t ::= x | x | tt | [d]t -> t PPC Terms 

where 8 is a list of names. Letter a (resp. b, p) is used to indicate a term in position of argument (resp. 
function body, pattern). 

As pictured below, in the abstraction [6]p -> b the list of names 6 binds matchable occurrences in 
the pattern p and variable occurrences in the body b. Substitution of free variables and a-conversion 
are deduced (see [JK09] for details on PPC, or Figures [T] and [2] for a formal definition in an extended 
setting). 

\x^xx x = a [y] xy -> yx 

One of the features of PPC is the use of a single syntactic application for two different meanings: 
the term t\t% may represent either the usual functional application of a function t\ to an argument t% or 
the construction of a data structure by structural application of a constructor to one or more arguments. 
The latter is invariant: any structural application is forever a data structure, whereas the functional appli- 
cation may be evaluated or instantiated someday (and then turn into anything else, including a structural 
application). 

The simplest notion of pattern matching is syntactic: an argument a matches a pattern p if and only 
if there is a substitution a such that a = p a . However, with arbitrary patterns, this solution generates 
non-confluent calculi BvOo90ll . To recover confluence, syntactic matching can be used together with 
a restriction on patterns, as for instance the rigid pattern condition of the lambda-calculus with pat- 
terns MKvOdV081 . The alternative solution of PPC allows a priori any term to be a pattern, and checks 
the validity of patterns only a posteriori, when pattern matching is performed. In particular, the restriction 
on patterns applies only once the evaluation of the pattern is completed. This allows a greater freedom 
of evaluation and a greater polymorphism of patterns, and hence a greater expressivity. 

This is done by a more subtle notion of matching, called compound matching, which tests whether 
patterns and arguments are in a so-called matchable form. A matchable form denotes a term which is 
understood as a value, or in other words a term whose current form is stable and then allows matching. 
Matchable forms are described in PPC at the meta-level by the following grammar: 

d ::= x | dt PPC data structures 

m ::= d \ [d]t ^ t PPC matchable forms 
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Compound matching is then defined (still at the meta-level) by the following equations, taken in order. 



We *} 

{{ai<3 2 /e P1P2} 
U/e P} 
U/e P} 



{iH-a} if x G 8 

{} ifxge 

-{[ai jg pij l±) /g P2§ if ai«2 and p\P2 are matchable forms 
_L if a and p are matchable forms, otherwise 

wait otherwise 



Its result, called a match and denoted by p, may be a substitution (written a), a matching failure 
(written _L) or the special value wait. The latter case represents undefined cases of matching, when the 
pattern or the argument has still to be evaluated or instantiated before being matched. 

Decomposition of compound patterns in the equations above is associated with an operation ttJ of 
disjoint union which ensures linearity of patterns: no matching variable should be used twice in the same 
pattern, or confluence would be broken [KI08O]. Its formal definition is: 

• LtJ is commutative. 

• _L l±l p = _L for any p (even wait). 

• wait l±)p = wait for p ^1. 

• (7i l+J (72 = -L if domains of 0\ and 02 overlap. 

• (5\ ttJ a% is the union of 0\ and a 2 otherwise. 

Finally, PPC has to deal with a problem related to the dynamics of patterns: a matching variable 
may be erased from a pattern during its evaluation. In this case, no part of the argument would be bound 
to this matching variable and then no term would be substituted to the corresponding variable. Hence 
free variables would not be preserved, which would make reduction ill-defined (see Example [TJ. This is 
avoided in PPC by a last (meta-level) test, called check: the result {a/g p} of the matching of a against 
p is defined as follows. 

• if {We p}} = 1 then {a/g p} = 1. 

• if {{a jg /?}} = a with dom(a) 7^ 6 then {a jg p} = _L. 

• if {{a jg p\ = a with dom(a) = 6 then {a jg p} = a. 

Remark that {a jg p} is not defined if {{a jg /?}} = wait. 

Finally, the reduction — >ppc of PPC is defined by a unique reduction rule (applied in any context): 

([e]p^b)a b^/'ti 

where for any b and a the expression b c denotes the application of the substitution a to the term b, and 
b L denotes some fixed closed normal term _L. 

Example 1. Let t be a PPC term. The redex {[x]cx -> x) (ct) reduces to t: the constructor c matches itself 
and the matchable x is associated to t. On the other hand, {[x,y\cx -> xy) {ct) reduces to _L: whereas 
the compound matching is defined and successful, the check fails since there is no match for y and the 
result would be ty where y appears as a free variable. The redex ([x]cx x) (c) also reduces to _L since 
a constructor will never match a structural application. And last, {[x]yx -> x) (ct) is not a redex since the 
pattern yx has to be instantiated. 
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2 Explicit Matching 



This section defines the Pure Pattern Calculus with Explicit Matching (PPCem), a calculus which gives 
an account of all the steps of a pattern matching process of PPC. The first point discussed is the iden- 



tification of structural application (Section 2.1 ). An explicit calculus is then fully detailed (Section 2.2 1 



and some of its basic properties are proved (Section 2.3 1. Explicit formulations of simpler pattern calculi 
already appear in HCK041 lFor02l ICFK04I . 



2.1 Explicit Data Structures 

Firstly, a new syntactic construct is introduced to discriminate between functional and structural appli- 
cations (as in [FMS06] for the rewriting calculus for instance). Any application is supposed functional a 
priori, and two reduction rules propagate structural information. The explicit structural application of t 
to u is written t • u. 

t ::= x\x\tt\ft \ [6]t -> t PPC, terms 

d ::= x\t»t PPC, data structures 

xt — >, x»t 
(ti»t 2 )t 3 — {h»t 2 )»h 

The identity morphism embeds PPC into PPC,. The subset of PPC, defined by PPC is referred to as 
the set of pure terms. On the other hand, a "forgetful" morphism maps PPC, terms back to PPC terms 
(or pure terms): 

14 ■■= x 
ffl ■= * 

ItM := [/life] 
[ti.fc] := IhjM 
l[d]p^b\ := 

Some PPC, data structures are not mapped to data structures of PPC, for instance ([6]p -+b)»a. 
However, for any pure term t, if t — y* t' and t' is a PPC, data structure, then t is a PPC data structure 
(proof by induction on t). One can also observe that for every PPC data structure t, there exists a 
reduction t — >l t' with t' a PPC. data structure. Call well-formed a term t such that [f] — ►* t. 



2.2 Explicit Pattern Matching 

Another new syntactic object has to be introduced to represent an ongoing matching operation. The basic 
information contained in such an object are: the list of matching variables, a partial result recording what 
has already been computed, and a representation of what has still to be solved. 

This new object is called matching and is written (0|ju|A) with 6 a list of names, pt a decided 
match (that means, _L or a substitution), and A the collection of submatchings that have still to be solved 
(a multiset of pairs of terms). For now on, we will consider only decided matches, written /I (wait does 
not exist as such in PPCem)- 

The complete new grammar is: 

t ::= x | x | tt | /•/ | [6]t -+t \ t (0|jU|A) PPCem terms 

d ::= x\ft PPCem data structures 

m ::= d\[6]t->t PPCem matchable forms 
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The set of free names of a term t is fn[t) = fv(t) Ufm(t). 


Free variables 






= \x\ 


fv(x) 


= 


J v \ 1 




Mh»t 2 ) 


= ^(^ 1 )u/v(^ 2 ) 


fv{[d\p^b) 


= Jv(p)U(Jv(b)\e) 


Mt(e\ii\A)) 


= (fv(t)\e)Ufv(codom(il))uMA) 


Free matchables 




fm(x) 


= 


fm(x) 


= {4 


fm{t\t 2 ) 


= fm{h)U fm{t 2 ) 


ftn(ti»t 2 ) 


= fm(h)Ufm(t 2 ) 


fm{[d]p^b) 


= (fm(p)\6)Ufm(b) 


fm(t{6\n\A)) 


= fm{t)Ufm{codom{il))Ufm{n x (A)) U {fm{n 2 {A)) \ B) 


where if A= (ai,pi)...(a„,p„) then fm(n x (A)) = U,-/m(a,-) and fm(n 2 (A)) = {J i fm(p i ). 



Figure 1: Free names of a PPCem term 



{tu) a 

(fu)° 
{[9]p^b)° 

{t{d\n\A)Y 



Ox 

X 



([6}p°^b°) 
t a (d\il a \A a ) 



x £ dom(o) 
x dom(o) 



dr\(dom(o)Ufn(o)) 
d n (dom(o)U fn(o)) 








where in A a (resp. ii a ) the substitution propagates in all terms of A (resp. of the codomain of ii). 



Figure 2: Substitution in PPCem 
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Initialisation 

([B)p^b)a S'b b(e\<d\{a,p)) 

Structural application 

X? 7". 

Matching 

Since A has been defined as a multiset of pairs of terms, its elements are not ordered. In the following 
rules (a,p)A denotes the (multiset) union of A with the singleton {(a,p)}. 

The first three matching rules are for successful matching steps. 

b (6\jJ,\(a,x)A) — > m (0|ju tb) {jc I— >- a}|A) if x G B and fn(a) n 6 = 

b(B\n\(x,x)A) — > m b(e\n\A) ifxgd 
b(6\n\(a 1 »a 2 ,pi»P2)A) — >m &(0|jll|(ai,pi)(a2,P2)A) 



The last six matching rules are for failure, and could be summed up as "for any other matchable forms a 
and p,\etb(6\n\(a,p)A) reduce to b (0|_L|A)". 

b(0\n\($,x)A) — > m b(d\±\A} 

b{6\n\( ai *a 2 ,x)A) — >« b(d\±\A) 
b(d\^\([d a ]p a ^b a ,x)A) ^ m b(d\±\A) 

b(e\n\(x, P1 »p2)A) b(e\±.\A) 
b(e\^\([e a ]p a ^b a , Pl *p2)A} — > m b(e\±\A) 

b(6\ix\(a,[6 p ]p p ^b p )A) — > m b(6\±\A) 

Resolution 

b(6\o\<d) — > r b° if dom(a) = 6 (substitution rule) 

b(d\o\<d) ^ r J_ \idom{c)^e 
b(e\±.\A) — > r j_ 



if x S" 6 and x^y 

ifxgd 

ifxgd 



Figure 3: Rules of PPCem 
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A pure term of PPCem is a term without any structural application or matching (that means a PPC 
term). As in PPC, the symbol _L used as a term denotes a fixed closed pure normal term. 

Free variables and matchables are defined in Figure [T] as a natural extension of PPC mechanisms 
to explicit matching. Similarly, a notion of (meta-level) substitution is deduced from this definition 
(Figure[2]). Finally, a notion of a-conversion is associated, and from now, on it is supposed that all bound 
names in a term are different, and disjoint from free names. 

New rules for matching are of three kinds: an initialisation rule — >b which triggers a new matching 
operation, several matching rules — > m corresponding to all possible elementary matching steps and 
three resolution rules — >,- that apply the result of a completed matching. The complete set of rules of 
PPCem is given in Figure|3] 

Reduction — >em of PPCem is defined by application of any rule of — >b, — >,, — > m or — > r in 
any context. The subsystem — > p = — >, U — > m U — > r computes (when possible) already existing 
pattern matchings but does not create new ones. 

2.3 Confluence and Simulation properties 

This section states and proves four theorems on basic properties of PPCem and its links with PPC. The 
first one is a result on the normalization of already existing pattern matchings. 

Theorem 1. — > p is confluent and strongly normalizing. 
Proof. 

• We define two well-founded orders -< f jy and -<y, whose lexicographic product contains p < — . 
This will enforce strong normalization. 

- -<jy sorts terms with respect to the nesting of matchings. It is based on an over-approximation 
of the depth of potentially nested matchings (matchings that are syntactically nested or that 
may become such after some substitutions). For any lists of names 6j, decided matches /!,-, 
and lists of pairs of terms A,-, the sequence (0i|/^i|Ai) ;...;(6 n \^l n \A n ) is called a potentially 
nested chain of length n if for each i 6 { 1 . . .n — 1 } one of these conditions holds : 

* Nesting: |jU,- + i |A,- + i) appears in A, or in the codomain of fr. 

* Potential nesting: a variable of appears in A,- or in the codomain of ju,-. 

The set of maximal chains of a term t is the set of all potentially nested chains that can be 
built using the matchings appearing in t and that can not be extended (neither by the left nor 
by the right) using other matchings of t. For this extraction, remember that all bound names 
in t are supposed to be different, and disjoint from free names. The depth of t is the multiset 
of the lengths of the maximal chains of t. 

Example 2. Write t = c (0|0| (x,c)(x,c)) {x\ x i— > y (y|0|(c,y)) |0). The term t contains 
three matchings and has one maximal chain of length 3, which is 

(0|0|(x,c)(x,c)); (x\x^y (v|0| (£,?)> 1 0) ; (y 1 1 (c,y) > 

The reduction t — > r t' = c (0|0| (vi (yi |0| (c,yi)} ,c) (y2 (y2|0| {c,fi)) ,c) ) yields a new 
term t 1 which still contains three matchings ( one was reduced and disappeared but another 
one was duplicated) and admits two maximal chains of length 1, namely 

<0|0| (y x (yi|0|(£,/i)> ,c) (y 2 (y 2 |0|(c,/ 2 )) ,c) ) ; (yi |0| (c,yi)> 

<0|0| (yi (yi|0|(c,yi)) ,c) (y 2 <y 2 |0|(c,y 2 )) ,c) > ; (y 2 10| (c,y 2 ) > 
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The usual order on natural integers gives a well-founded order on the lengths of potentially 
nested chains, -<js is defined as the multiset extension of this order, applied to the depths of 
terms. It strictly decreases for any reduction by the substitution rule, and is less or equal for 
any other reduction. 
- -<y is the natural order on the size of terms, defined as follows: 

y(x) := 1 
y(x) := 1 
y{ht 2 ) := y{h) + ,9>{t 2 )+2 

y(h.t 2 ) : = y( h )+y(t 2 ) + \ 
y([e]p^b) ■= y{ P )+y{b) 

where we write e G# A when the element e appears in the multiset A with multiplicity k. 
-<y strictly decreases for any reduction except by the substitution rule. 

• Matching rules generate some critical pairs, most of which are trivially convergent. The most 
subtle case is the reduction of a non linear matching: 

(0|;UW{xi->-ai}|(a2,jc)A) p < — (0|/^|(ai,x)(a 2 ,x)A) — > p (6\fi l±l {x \-t a 2 }\ {a\ ,x)A) 

Since tt) is a disjoint union of substitutions, both sides can be reduced to (0|_L|A). 
Finally, — > p is weakly confluent, and then confluent by Newman's Lemma [Ter03|. 

□ 

The second theorem states the confluence of — >em- Since the reduction of PPCem is defined by 
several rules, the result does not fall into the modular framework of [ JK09 ]. It is proved here directly by 
the Tait and Martin-Lof 's technique. The main construction of the proof is the definition (in Figure [4]) 
of a parallel reduction relation enjoying the diamond property (Lemma[3]). The relation is first 
linked to — >em in Lemma [T] 

Lemma 1. — > E m Q =>- Q — >* E m 
Proof. 

• — >em ^ ==$■ by induction on the definition of — >em- 

• =>• C — y* EM by induction on the definition of ==>. 

□ 



Lemma 2. Ift =>• t' and a =>• a' then t a =>• t 
Proof. By induction on the derivation of t => t'. 
Lemma 3. <^==^ C =^<^= 



□ 



Proof. Suppose t\ <= t ==>■ t 2 . Induction on the derivations of t ==> t\ and t t 2 : 

• If one of the reductions is by "Id", the conclusion is immediate. 

• If one reduction is by a "Cgr" rule, and the other by a "Cgr", "Init", "Struct", or "Match" rule, then 
the induction hypothesis applies straightforwardly. 
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Id. 



Cgr. 



Init. 



Struct. 



Match. 



Res. 



t =>f 



=^ t[ ?2 =^ ?2 ? 1 = ^ > Z l f 2 =^ t'i p ==> p' b =^> ft' 

tl t 2 =>t[t 2 tvt 2 =>t[»t' 2 [d]p ->■ ft \0}p' V 

b^b' M A => A' 



p ==> p' b ft' a o 
([d]p^b)a^b'(d\d>\(a',p')) 



Xf=^x»f' (?l»fa) *3 => ( f i*4)**3 

ft' =>• ju' a =>• a' A =>• A' 



ft(0|ju|(a,x)A) =^ 1/ <0 ttl {jc •->■ A') 

b ft' M => A => A' 



i£ ejn(a)ne = 



ft(0|jU|(x,x)A) =>«/<Oj/* , |A / ; 



A' 



ft(0|At|(ai.a2,pi.p 2 )A)^ft , (0|At / |W,/ 1 )(a 2 ,p 2 )A / ) 
ft' A A' 



&(0|/i|(fl,p)A) =^>ft'(0|±|A 



— a and p other matchable forms 



^^J O m(a) = t 6 *<0|JL|A>=>± 

b (e\a\<d)^(b') a b(e\o\®)=>± "w/^^ 

As in Figure [3] the last "Match" rule could be explicited in six fail rules. 
Parallel reduction is straightforwardly extended: 

• to decided matches (p.) by applying =>• to all terms in the codomain of a substitution (with more- 
over _L J_). 

• to multisets of pairs of terms (A) by applying to all terms. 



Figure 4: Definition of parallel reduction relation 
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• If one reduction is by a "Cgr" rule and the other by a "Res" rule, there is one non trivial case: 
suppose t\ (0|<7i|0) <= t (0|ct|0) t® 2 . By induction hypothesis there are t-$ and CT3 such that 
t\ ==> ?3 tj_ and (5\ => G3 -4= 02. Then we can derive t\ (6 \ 0\ |0) =>■ . Finally, by Lemma|2] 
we conclude that t° 2 t^ 3 . 

• If both reductions are by a "Init" rule, then the induction hypotheses apply straightforwardly. 

• Idem for two "Struct" or two "Match" rules. 

• Case where both reductions are by a "Res" rule. Reductions to _L are straightforward. Then 
consider the following case: t° [ <= t (0|a|0) =>■ t° 2 . By induction hypotheses t\ tj 4= tj 
and di =>- 03 <S= o 2 . By Lemma^f^ 1 t° 3 -4= t° 2 . 

□ 

Theorem 2. PPCem is confluent. 

Proof. Since ==> has the diamond property (Lemma [3]), its transitive closure also enjoys the di- 
amond property ([Ter03]). Moreover Lemma [T] implies — >* EM = and then — >* EM enjoys the 
diamond property. Finally, — >em is confluent. □ 

The last two theorems establish a link between the calculus with explicit matching PPCem and the 
original implicit PPC. 

Lemma 4. If \a j$ p\ = p with \x a decided match, then for any po and A there are p' with = p 
and a reduction 

{0\fM>\(a,p)A) (—►.□—►„)* (0\nottn'\A) 
Proof. Induction on ^a /g /?}}. 

• \a /g x]} with x G 8 or \x /e x}}- with x g" 0: immediate. 

• \aa§ Jq ppo\ with aao and ppo matchable forms. Hence a = a n ...a\ and p = p m ...p\ with a n 
and p m constructors. Then a n ...a\ao — >l a n • ... • a\ • a§ and p m ...p\po — >l p m • ... • p\ • pq. 

Suppose n > m, then {{aa /e p/?o]f = $a m ...a n / e p n J l±) §a„_i / e /? n _i}} tfcl ... tfcl |flo/e Ml and 
(0|^. | («n • a ,/7 m • ...» ^>o)A) — ►* {6\no\(a m »...»a n ,p n )(a n -i,p n - 1 )...(ao,po)A). Case on 
Pn = x: 

- If x G 6 then the matching reduces to (0|jUo W{jc i-> a m • ... •a„}|(a n _i, p ra _i)...(ao,/?o)A). 

- If x g" then the matching reduces to (0|jUg|(a re _i,/? n _i)...(ao,/>o)A) with /Xq = /^o or /^q = _L. 

In any of these two cases, the induction hypothesis gives the conclusion. In the case where m > n, 
the same method allows to derive a reduction to _L. 

• Cases of matching failure: for instance \x /$ ytlfc. The following reduction gives the conclusion: 
(6\po\(x,yt)A) — (d\po\(x,y.t)A) ^ m <0|J_|A>. 

□ 

Theorem 3. For any terms t and t' of PPC, if t — >ppc t' then t — >* EM t'. 

Proof. Suppose t — >ppc t'. There is a context C[] such that t = C[([6]p -> b)a] — >ppc C[b'} = t' and 
\a /e p ]f = jli with p a decided match. 

ByLemrna[5]([0],p^% — 6(0|0|(a,/>)) (— >. U ^ m )* b{0\n\9). 
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Case on p: 

• If^ = -LthenZ7' = _Landft(0|_L|0) — > r _L. 

• Else p = o and: 

- If dom(o) = d then V = b a and b(6 1 a|0) 

- ElseZ/ = _Land6(0|_L|0) — > r _L. 



□ 



The map [•] is naturally extended to any PPCem term, set of PPCem terms and decided match, as well 
as the notion of well-formedness. Then, for any p and A not containing any explicit matching, define the 
semantics of the matching (d\p\A) by: 

\e\m = M^[ l+J «W/eWS 

\(a,p)eA 

Note that the semantics can be wait. 

Lemma 5. For any well-formed p, p', A and A' which do not contain any explicit matching, 
if(d\p\A) — {B\n'\A') or (6\n\A) — <0|ju'|A') then [0|ji|A] = [0\n'\A'l 

Proof. Case on the reduction rules. □ 
Lemma 6 (|JK09)). Ift — > PPC t', then t a — > PPC t' a . 

Let t be a PPCem term, and ?' the unique normal form of t by — > p . Write ?4- and call purification of t 
the term . Note that the purification may not be a pure term if there is an unsolvable matching in it. 

Theorem 4. For any well-formed terms t and t' of PPCem, 

ift — >em t' and t\. and t'l are pure, then t\, = t'\ or t\ — > PP c t'\, . 

Proof. Induction on t — >em t'. 

• Case t = ([6]p -> b)a — > p b{6\Q)\(p,a)) = t' . The term t' \, is pure, then there is a sequence 
bl(6\<b\{pi,al)) ( — U — > m )* hi (0|ju|A) — > r t" where \t"\ = t'l and where A = or ii = 1. 
By Lemma[5j [jx\ = ^al /q p\,~§. Then, by case on matching resolution, t\, — > PP c ft"} = t'l. 

• Other base cases: if t — > p t' , then t\. = t'l . 

• Case t = b(6\n\A) — > EM b' (B\n\A) = t'. The term 4 is pure. Then (0|ju|A) — >* p {0\n'\A!) 
where A' = or p! = _L. If p' = _L or dom(p') ^ 6, then t\.= t'\.= _L. Suppose A' = and p' = a 
with dom(a) = 8. Hence t\= (b\.) a and t'\.= (b'l) a . By induction hypothesis b\, — > PP c b'\., 
and then by Lemma [6] t\. — > PP c t'l . 

• Other inductive cases are straightforward. 

□ 

This section introduced the new calculus PPCem for explicit matching with dynamic patterns, and 
proved its confluence. It also expressed a bidirectional simulation between PPC and PPCem'- first any 
reduction of PPC is reflected in PPCem by a sequence. On the other hand, a reduction of PPCem can 
be mapped on zero or one step of PPC if and only if its source and its target are well-formed and can be 
purified. Next section discusses how this new calculus can be used. 
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3 Discussion 

3.1 Reduction Strategies 

Pattern matching raises at least two new issues concerning reduction strategies (i.e. the evaluation order 
of programs). One is related to the order in which pattern matching steps are performed, the other 
concerns the amount of evaluation of the pattern and of the argument performed before pattern matching 
is solved. 

Some remarks about the order of pattern matching steps. 

PPCem uses a multiset as the third component of a matching (6 |;U|A) to represent all the remaining work. 
The calculus is thus able to cover all the possible orders of pattern matching steps. A particular strategy 
may be enforced by giving more structure to the multiset A and by adapting the matching reduction rules. 
Example 3. Suppose that A is now a list of pairs of terms, and (a,p)A denotes the usual "cons": it 
builds the list whose head is (a,p) and whose tail is A. Then the rules of Figure^implement a depth- 
first, left-to-right pattern matching algorithm. 

Example 4. Now assume the list structure of Example^and replace the right member of the reduction 
rule (d\jx\(a\»a2-,p\* Pi)A) — > m {9\n\( a iiPi)(a2,P2)A) by (6\n\A(ai,p\)(a2,P2)}- Then pattern 
matching is done in a completely different order! 

More generally, if some permutations of the elements of A are allowed, lots of richer matching behaviours 
may be described in PPCem- 

Pattern and argument evaluation: what is needed? 

In PPC, a naive evaluation strategy for a term ([6]p -> b)a could be: evaluate the pattern p and the 
argument a, then solve the matching (atomically). As the usual call-by-value, this solution may perform 
unneeded evaluation of the argument, for instance in parts that are not reused in the body b of the 
function. The most basic solution to this problem, call-by-name, allows the substitution of non-evaluated 
arguments. But how can such a solution be described in a pattern calculus? 

In the context of pattern matching, some evaluation of the argument has to be done before pattern 
matching is solved. However the exact amount of needed evaluation depends on the pattern. Hence 
pattern matching enforces some kind of call-by-value where the notion of value is context-sensitive. 
Moreover, even the evaluation of the pattern may depend on the argument! 

This makes the description of a strategy performing a minimal evaluation of the dynamic pattern 
and the argument rather difficult. One may keep for the object-level a compact formalism like PPC 
by defining complex meta-level operations finely parametrised by terms. This is done in MKLR101 to 
describe standard reductions in a simpler pattern calculus. In contrast to this solution, we want to show 
here how the richer syntax of PPCem allows a simple description of such a reduction strategy. 

Indeed PPCem allows to interleave pattern and argument reduction with pattern matching steps. 
This finer control allows for instance an easy definition of a "matching-driven" reduction, as pictured in 
Figure [5] 

The idea here is to trigger pattern matchings as soon as possible. Then the pattern and the argument 
are evaluated until they become matchable, and one or more pattern matching steps are performed be- 
fore the story goes on. A formal definition of a strategy implementing this picture is by restricting the 
reduction under a context to the only four rules given in Figure [6] 

Moreover, it can be checked that the list structure of Example [3] associated with the rules of Figure [3] 
and the context rules of Figure [6] gives a deterministic reduction strategy for PPCem (which means that 
any term has at most one authorised redex). 
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Figure 5: Matching-driven reduction strategy 
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Figure 6: Context rules for matching-driven reduction 
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Figure 7: Partial substitution rules 
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3.2 An Extension: Partial Substitution 

Relaxing the matching procedure generates new possibilities of evaluation, which may bring more partial 
evaluation, more sharing or more parallelism. We explore here an extension of PPCem where the partial 
result of a matching can be applied to the function body before the matching process is completed. 

Example 5. Consider the following reduction: 

(M*z-(([0]* -*)£))(&) 

^B (Mjfe->(*(0|0|(£,jc)}))(&) 

The matching (0|0|(c,Jt)) is blocked because of the presence of the variable x in the pattern. Still, the 
external application can be evaluated: 

(M0|0|(C,*)» (*|0| 

—►2 (fc(0|0|(c,x)))(*|0|(c.M»z)> 

^ m (b(Q)\®\(c,x)))(x\<d\(c,x)(t,z)) 
^ m (fc(0|0|(c,*))) (x\{x^c}\(t,z)) 

Now, the external matching (x\{x>- 1 £}\(t,z)) is also blocked because of the variable z- However, its 
partial result is a substitution for x which, if applied, may unlock the internal matching. Indeed, allowing 
this partial substitution could lead to a reduction like: 

— ► (M0|0|(c,c)))(x|{x^c}|(?,z)) 
^ m (6(0|0|0})(x|{x^c}|(/,z)) 
— > r b{x\{x^c}\(t,z)) 

where the internal matching is finally solved! 

This kind of power may be of interest in two situations: 

• By allowing more reduction in open terms, we gain more partial evaluation capabilities. This may 
be interesting for greater sharing and efficient evaluation MHG91L 

• Suppose now that z is replaced in the example by a possibly big term. In a parallel implementation 
we could complete the external matching and evaluate the internal one in parallel. As pointed out 
in BFMS06I . this might represent another gain in efficiency. 

A light variation on PPCem gives this new power to our formalism. The principle of this variant is to 
systematically apply partial results (substitutions) as soon as they are obtained. Hence they do not need 
to be remembered in the object representing ongoing matching operations. Only a list of used variables 
is remembered for linearity verification. 

The object representing a matching is now (0|t|A) where z is either _L or the list of the names of 
the matching variables that have already been used. Now the test of disjoint union of substitutions is 
replaced by a simple test against T, while the final check compares 6 and T. 

Initialisation, structural application, and most matching rules are the same in this variant. The only 
differences are for the first matching rule and the resolution rules, which are now as in Figure [7] 

Any PPCem term can be translated into a term of this new calculus by applying the following trans- 
formation: turn any b (0\o\A) into b a (6\dom(a)\A) (there is nothing to change in a failed matching). 

The simulation between PPCem and this extension is only one way: any reduction of PPCem is 
mapped by the previous morphism to a reduction sequence, but the converse is not true. Indeed the 
calculus with partial substitution allows new reductions, as pictured in Example [5] Confluence for this 
variant seems to be provable using the same technique as for plain PPCem- 
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Conclusion 

The Pure Pattern Calculus is a compact framework modelling pattern matching with dynamic patterns. 
However, the conciseness of PPC is due to the use of several meta-level notions which deepens the 
gap between the calculus and implementation-related problems. This contribution defines the Pure Pat- 
tern Calculus with Explicit Matching, a refinement which is confluent and simulates PPC, and allows 
reasoning on the pattern matching mechanisms. 

This enables a very simple definition of new reduction strategies in the spirit of call-by-name, which 
is new in this kind of framework since the reduction of the argument of a function depends on the pattern 
of the function, pattern which is itself a dynamic object. In the same direction, it would be interesting to 
express standardisation in pattern calculi (as presented for example in KLRIOID using explicit matching. 
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